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Abstract 

We show that the top eigenvalue of an n x n random symmetric Toephtz matrix, 
scaled by \J2n log n, converges to the square of the 2 — )• 4 operator norm of the Sine 
kernel. 



1 Introduction 



An n X n symmetric random Toeplitz matrix is given by 



ao 



ao 
: a\ 
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{{.(^\i~j\))o<i,j<n- 



where (aj)o<i<„_i is a sequence of independent random variables. This article establishes the 
law of large numbers for the maximum eigenvalue of this matrix as n — >■ oo. The study deter- 
minist ic Toeplitz operators has a rich theory. Se e the classical boo k by lGrenander and Szego 



(Il984l ) or more recent ones by Bottcher et. al. (Il999l . 120001 . l2006l ). In contrast, the study of 
random Toeplitz matrices is a relatively new field of research. The question of establishing the 
limiting spectral distribution of random Toeplitz matrices with inde pendent entries was first 
posed in the review paper by iBail (119991 ). The answer was given byO 

using method of moments. Since then the study of asymptotic d istribution of eigenval 



ues o: 



fl2005h . 



beplitz matrices has attracte d cons id erable attention e .g., see 



Bose and SenI (|2008| ). iKarginI (120091 ) . IChatterjed (120091 ) and the references therein. 



Hammond and Miller 
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The problem o f studying th e maximum eigenvalu e of random Toeplitz matrices is raised in 



( iBryc et al. 



20061 Remark 1.3). iBose and Sen! (120071 ) established the law of the large numbers 



for the spectral norm of Toeplitz matrix when the entries are i.i.d. with some positive mean 
and finite variance. But pinpointing the exact limit for the spectral norm of the Toeplitz 
matrix when there is no perturbation, that is, when the entries are mean zero turned out 
to be much more challenging. This is partly due to the fact unlike the Wigner case where 
the limiting spectral distribution, the semicircular law, has a compact support and the top 



Bai and Yin 



eigenvalue converges to the right end point of the support (this was proved by 
19881 ). the limiting spectral distribution of Toeplitz matrices has infinite support. As a 
result, there is no natural guess to begin with. Another difficulty is that currently there are 
no useful estimates available for the trace of high powers of the Toeplitz matrix tr(T^) when 
k = k(n) g o es to infinity. 



Mecked (|2007| ) showed that if the entries have zero mean and uniformly subgaussian 



tail then the expected spectral norm of an x n random Toeplitz matrix is of t he order of 



Adamczak 



nlog n, a significant departure from the standard ^/n scaling of the Wigner case. 
( I2OIOI ) shows concentration, more precisely result that the random Toeplitz matrix divided 
by the expected spectral norm converges almost surely t o 1 if t he entries are i.i.d. with zero 



mean and finite variance. 



Bose. Subhra Hazra and Sahal (l201ol ) gave an upper bound and a 



lower bound for the right tail probability of the spectral norm of Toeplitz marix scaled by 
n^/" when the entries are i.i.d. random variables belonging to the domain of attraction of an 
a-stable random variable with index < a < 1. 

Throughout the paper, we will have the following standing assumption on the entries of 
our random Toeplitz matrix. 

Assumption. (aj)o<i<„_i is a triangular array of independent random variables (we sup- 
press the first index). There exists constants 7 > 2 and C finite so that for each variable 



Eai = 0, Ea^ = 1, 



and 



E|a,-r < C. 



Define the integral operator corresponding to the sine kernel by 
Sin(/)(a:) := / ^^^(^(^ " ^)) for / G 



7r(x - y) 



and its 2 — )■ 4 operator norm as 



|Sin||2^4 := sup ||Sin(/)||4 

II/I|2<1 
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where := {J^ \f{x)\^y^^ denotes the standard LP-norm. ||Sin||2^4 is the solution of the 
convolution optimization problem (see the Appendix): 

sup{||/ -k /II2 : / even of L^-norm 1 supported on [—1/2, 1/2]} (1) 

For a Hermitian matrix A, we denote by Ai(A) and A„(A) the maximum and minimum 
eigenvalue of A respectively. The following theorem is the main result of our paper. 

Theorem 1. Let T„ be a sequence of n x n symmetric random Toeplitz matrix as defined 
above with its entries satisfying the above assumption. Then 

^i(Tn) :^ ||sin||^^4 = 0.8288... as n ^ 00. 
y/2n logn 

Recall that a sequence of random variables converges in to a constant c, denoted by 

r p 

Xn ^ c, if E|X„ - c\P 0. 

Remark 2. Note that L'^ convergence is as best as we can hope for in Theorem [1] This is 
because of the fact that maximum eigenvalue of a symmetric matrix dominates the diagonal 
entries. So Ai(T„) > and E|Ai(T„)|^ can be infinite for any p > 7. 

By symmetry, the same theorem holds for — A„ and so for the spectral norm ||T„||sp = 
max(Ai, — A„) as well. 



1.1 Connection between Toeplitz and Circulant matrices 



The starting point our analysis of the maximum eigenvalue is a connection between a Toeplitz 
matrix and a circulant matrix twice its size. 

Observe that T„ is the n x n principal submatrix of a 2n x 2n circulant matrix C2n = 
mod 2n)o<jj<2n-i, whcrc bj = Gj ioT < j < u and bj = a2n-j for n < j < 2n. We hope 
to relate the spectrum of Toeplitz matrix to that of the present circulant matrix twice its 
size which can be easily diagonalized as follows: 



(2r2)-i/2c 



2n 



Uo„dLu^ 



where is the discrete Fourier transform, i.e. a unitary matrix given by 

1 



U2n(j, k) 



'2n 



exp 



/ 27Tijk\ 



<j,k <2n-l 



and 



diag((io, rfi, . . . , rf2n-i), where 
^" ^ / 2TTijk\ 



1 2mjk\ 1 



n-1 

ao + {-lybn + 2 ^ Ofc cos 

k=l 



f 27T jk\ 



(2) 



Clearly, dj = d2n-j for all n < j < 2n. If we write 

In Or 



Q 



2n 




then Tn and Q2nC2nQ2n have same nonzero eigenvalues by our observation. Moreover, the 
matrix (2n)~^/^Q2nC2nQ2n has the same eigenvalues as its conjugate 



{2n)-'/^lJl^Q2nC2nQ2n'U2n = P2nDLP 

where 



2n 



2n 



ULQ2nU2„ (3) 



Consequently, we have a useful representation of the maximum eigenvalue of the Toeplitz 
matrix 

Ai(r2-V2T„) = y2Ai(P2.DLP2n). (4) 

as long as the right-hand side is not zero. We point out here that the matrix Q2nC2nQ2n 
(and so P2„D2„P2n) does not depend on the value of 6„. For notational simplification, 
we will drop the subscript 2n from the relevant matrices unless we want to emphasize the 
dependence on n. 

We have thus reduced our problem to studying the maximum eigenvalue of the matrix 
PD'I^P where P is a deterministic Hermitian projection operator and D''' is a random multi- 
plication operator defined on C^'^. One can view this representation as a discrete analogue 
for Toeplitz operator defined on the Hardy space H^. 

As we will see in Section [2] that (after suitably choosing the auxiliary random variable 
bn and the variance of ao) the algebraically diagonal entries of D''' are uncorrelated. In the 
Gaussian case we get independence, which makes the analysis much easier. In the general 
case, to analyze the lower bound, we need to use an invariance principle. The difficulty is 
that the top eigenvalue does not come from the usual central limit theorem regime of D''', 
but from modera te deviations. We overcome this by extending the invariance principle of 



Chatterjed (l2005l ) (based on Lindeberg's approach to the CLT) to the realm of moderate 



deviations. 



1.2 Heuristics and conjectures 

We first give a heuristic description of the origin of the limiting constant. Since P is a 
convolution with a decaying function, the matrix PDP in many ways behaves a like the 



4 



diagonal matrix D itself. In particular, its top eigenvalue comes from a few nearby extreme 
values of D. We can partition of the interval into short enough sections J, and show that the 
top eigenvalue is close to the maximum top eigenvalue of blocks P[J]D[J]P[J]. This likes to 
be large when the entries of D[J] are large. By large deviations theory, the real constraint 
on the best D[J] is that the £^-norm the vector of entries is bounded by a/2 logn. 

Now the top eigenvalue is an optimization problem, but now we have another £^ 
optimization over the entries of D[J]. Thus an £^-norm appears, and that the solution of 
these two problems together are asymptotically given by the 2 — )■ 4 norm of the limiting 
operator of P2n, which we then relate to the well-known Sine kernel. The relation is natural, 
since and the Sine kernel are both projections to an interval conjugated by a Fourier 
transform. 

Before we prove Theorem [1], let us state some conjectures and open questions. Each of 
these conjectures can be split into parts (a) the Gaussian case, and (b) the general case 
where suitable moment conditions have to be imposed. 

Conjecture 1. Let Vn be the top eigenvector o/PD'''P. Then there exist random integers 
Kn so that for each i E we have Vn{Kn + i) — )■ g{i), where g is the Fourier transform of 
the function g(x) = \/2f(2x — 1/2) and f is the (unique) optimizer in ([T]). 

Conjecture 2. With high probability, all eigenvectors o/PD"''P are localized: for each eigen- 
vector, there exists a set of size n°^^^ that supports 1 — o(l) proportion of the i'^-norm. 

Question 3. What is the behavior of n^^^'^Xi{Tn) — ■\/21ogn||Sin||2_>.4 ? 

Conjecture 4. The top of the spectrum of suitably shifted, and normalized, converges 
to a Poisson process with intensity ce~^^ for some c, > 0. 

Related to this, we have the following. 

Conjecture 5. The top eigenvalue of T„, suitably shifted and normalized, has a limiting 
Gumbel distribution. 

Conjecture 6. The eigenvalue process ofTn, away from the edge, after suitable normaliza- 
tion, converges to a standard Poisson point process on M. 

1.3 The proof of Theorem [1] 

In this section, we break the proof of Theorem [1] into its components. This also serves as a 
guide to the rest of the paper. 
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Proof of Theorem [II We start with the representation ([ 

Ai(n-i/2T„) = v^Ai(P2„dLp 



2n 



In formula ([2]) for diagonal entries of Dg^, we replace by V^aQ (this is legal via Lemma [5]) 
and choose 6„ = v^^n where a„ is an identical copy of independent of (aj)o<j<n (recall 
P2nD2„P2n does not depend on 6„) to obtain a new diagonal matrix D2„. 

In Section m Corollary [6] we show that we can assume that the a„ are bounded by n^^^. 
In Lemma [71 we establish tightness, so it suffices to show convergence in probability. In 
Section [31 equation fllSp we introduce a sparse version of the diagonal matrix D, whose 
top eigenvalue is close: 



|Ai(PDP) - Ai(PD^P)| < eV21ogn. 

The matrix is sparse enough that the whole question can be reduced to a block- 
diagonal version, where the blocks are determined by a random partition A of {1 . . .2n}. 
This is done in Lemma [9] with high probability. 



Ai(PD^P)- max Xi(P[J]B'\J]P[J]) 



0(1). 



where the parameter J refers to the minor corresponding to the subset of indices J. In 
Propositions [10] and [HI (Sections HI [5]) we then establish the asymptotic upper and lower 
bounds for the for the block diagonal version. Together, they give that with high probability, 
there exists e„ — )■ so that 

max Ai(P[J]D-[J]P[J]) = ./2\^{ml^, + o{l)) 
JeA:Jn57^0 

where 11 is the n — i- oo limit of P introduced in (fT3!) . Finally, in the Appendix, Lemma [19] 
we identify ||II||2^4 = -^||Sin||2_^4. This completes the proof of Theorem [1] □ 



1.4 Notation 

We write that a sequence of events {£n)n>i occurs with high probability (with high probability) 
when P{£^ri} 1- Let £^(C) (resp. £^(M)) be the space of square summable sequences of 
complex (resp. real) numbers indexed by Z. For square matrix A and a subset T of the 
index set, we denote by A[T], the principal submatrix of A which is obtained by keeping 
those rows and columns of A whose indices belong to T. We consider n as an asymptotic 
parameter tending to infinity. We will use the notations f{n) = Q{g{n)) or g{n) = 0{f{n)) 



6 



to denote the bound g{n) < Cf{n) for all sufficiently large n and for some constant C. 
Notations such as /(n) = u{g{n)) or g{n) — o{f{n)) means that g{n)/ f{n) — >■ as n — >■ oo. 
We write f{n) = Q{g{n)) if both f{n) = 0{g{n)) and g{n) = 0{f{n)) hold. 

2 Truncation and Tightness 

2.1 Truncation and changing the diagonal term 

For n > 1, define two arrays of truncated random variables by 

tti = af^ -. ail{|„.|<„i/^} - E[ail{|„.|<„i/^}], < i < n - 1 

and 

Qi — a-"^ := Var(ai)~^/^ai, < i < n — 1. 

We sometimes write T„(a) for T„ to emphasize its dependence on the underlying sequence 
of random variables (ai)o<i<n-i- Thus T„(a) and T„(a) denote the Toeplitz matrices built 
with random variables (ai)o<j<n-i and (aj)o<j<n-i respectively 

The next lemma says that the above truncation and the rescaling of the underlying 
random variables has a negligible effect in the study of the maximum eigenvalue of Toeplitz 
matrix. 

Lemma 3. We have, as n —>■ oo, 
(a) 

Ai(T„(a)) - Ai(T„(a)) L-y ^ 
\J n log n ' 

(b) 

Ai(T,(a))-Ai(T,(a)) ^ 
■\/nlogn 

Proof. Define 

«i := «il{|ai|<ni/7}, 0<i<n-l. 
Recall that for a matrix A, its spectral norm satisfies 

||A||gp < max^ \A{k,l)\ x max^ \A{k,l)\. (5) 
I ' fe 

In the special case when A is Hermitian, the above bound reduces to 

||A||sp < max V|A(A;,0|. (6) 

k — ^ 

I 
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Then we have 

Ai(T„,(a))-Ai(T„(a)) 



y/n\ogn 



^ ||T„(a)-T„(a,„., ^ 



n\^J \\sp ^ 2 X]j=o l'^j|l{|ai|>nl/7} 



n log n 



\/n log 



(7) 



which follows from the bound (|6]) and from the fact £^ norm of the each row of the Toeplitz 
matrix T„(a) — T„(a) is bounded by 2^"^^ |aj|l||„.|>„i/7|. We quote a standard moment 



bound for sum of independent nonnegative r andom varia 
thal's inequality in the literature (see, e.g. 



Latala 



1997 



3les, commonly known as Rosen- 
Corollary 3) which says that if 



are independent nonnegative random variables and p > 1, then there exists a 



universal constant Cp such that 



IL I IL It 

i=l \ 4=1 i=l 

Clearly, E[|ai|'''l||a.|>„i/7}] < E[|aj|'^] < C. On the other hand, by Holder's inequality 
and by Markov's inequality, this is bounded above by 
Therefore, by Rosenthal's inequality. 



E 



n-l 



i=0 



Combining ([7]) and OH]), we obtain 

|Ai(T„(a))-Ai(T„(a)) ^ 



E 



yjn \ogn 



Oin) 



0{n) 



{n logn)T/^ 



^ 0. 



Next we see that 



E 



Ai(T„(a))-Ai(T„(S)) 



^/n \ogn 



< 



EfllTja) -T„ra)||X 



spj 



(ralogn)^/^ 



which completes the proof of part (a). 
For part (b) we want to bound 



(nlogn)"'/'^ 



Ai(T„,(5))-Ai(T„(a)) 



A/n log 



n 



< 



|T„(a)-T 

a/""- log 



n(a)llsp ^.^/^ ||P(Dt(a)-D(a))P||,p ^ 



n 



yiog 



n 



(9) 
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Here we use the representation (jl]). Since P, being Hermitian projection matrix, has spectral 
norm equal to one, with a* = a — a we have 

||P(Dt(a) -D(a))P||sp = ||PDt(a*)P||3p < ||Dt(a*)||sp = max \d,{a*)\. 

0<j<n 

We have 

1 - Var(ai) = E[a,2l^|^^|>„i/^g + (E[ail{|a,|>„i/7}])^. 
We apply ([HD, and the similarly derived 

to get 1 — Var(a.j) = O^n^^^"^^^) uniformly in i. Note that a* = (1 — Var(aj)~^/^)aj which 
implies Ea* = and Var(a*) = 0(^-^+2/7) ^nd \a*\ < 4n^/^. 

Using the union bound and the identity K[X] = F{X > t}dt which holds for any 
nonnegative random variable X, we can write 



E 



max \dj{a*)\^' 

0<j<n 



<1 + ^ P I max |rfj(a*)|^ > tjc/t 

POD 

<l + (n + l)max/ F {\dJa*)\ > f/^] dt. (10) 

0<i<n J I 

If ^2; • • • ; be independent mean zero random variables, uniformly bounded by M, then 
the classical Bernstein inequality gives the following tail bound for their sum, 

" / /2 \ 

Using Bernstein's inequality, we obtain 

([lOD < 1 + (n + 1) / 2 exp ( ^. .j/'!^^] ^ dt 

POO 

< 1 + 2(n + 1) / exp {-t'/^ ■ f](ni/2-i/7)) dt = 1 + n ■ ^7/2-ig-f^(nV2-i/.)_ 



Hence, (logn) ^/^||D(a*)|| ^0 and so 



Ai(T„(a))-Ai(T„(a)) 



— )■ 0. This completes the proof of 



V ri log n 

part (b) of the lemma. □ 

Definition 4. Let T° be the symmetric Toeplitz matrix which has \/2ao on its diagonal 
instead of oq. 



Lemma 5. We have, as n ^ oo, 



Ai(T°)-Ai(T„) 
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Proof. The proof is immediate from the following fact 

E||T° - T„||Jp <{V2- iyE[\ao\''] = 0(1). □ 

Corollary 6. It sujfices to prove TheoremUlfor the symmetric random Toeplitz matrix T° 
defined in [7] where the random variables ai are independent mean zero, variance one and 
bounded by n^^^ . 

Proof. The proof is immediate from Lemma [3] and Lemma [5l □ 
Following (j4]), we can write 

\i{n-^'^Tl) = v^Ai(PDP), (11) 
with P as before and the entries of the diagonal matrix D = diag(c/o, (ii, . . . , d2n-i) given by 



V2ao + (-l)^V2a„ + 2 ^ cos 



k=l 



0<j< 2n, 



where bn '■= V^an, an being an independent copy of ao- The reason for choosing the 'right' 
variance for diagonal and the 'right' auxiliary variable 6„ is that now the variables dj,0 < j < 
n becomes uncorrelated (see Lemma [T8|) . Thus in the special case when {aj : < j < n} are 
i.i.d. Gaussian random variables with mean and variance 1, it follows that {dj : < j < n} 
are again independent Gaussian with mean and have variance 1 expect for d^ and dn who 
have variance 2. 



2.2 Tightness 



Lemma 7. For each n>l, let ao, ai, . . . , a„_i be a sequence of independent random variables 
that have mean zero, variance one and are bounded by v}!^ . For any p > 0, we have 



supE 

n>l 



Ai(T^ 



y/2n \ogn 



< oo. 



Proof. The proof is a direct application of Bernstein's inequality similar to what we did in 
the proof of part (b) of Lemma [3l From the representation ( fTTj) . we know that ?2^^/^Ai(T°) 
is bounded above by a/2 ■ maxo<j<n \dj\. Therefore, for any a > 0, 



E 



' Ai(T°) 
y/2n logn 



< E 



/ maxo< j<„ \d/^^' 



log n 



< a+(n+l) max 

0<j<n-l 



P 



\dj\ 



y/hgn 



> t^/P I dt. 
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By the Bernstein inequality, for n sufficiently large, 



max P < " > ^ < 2exp 



o<j<n {^/\ogn J \^ 2 + iy2nV7-i/2yi^.ti/p 

which implies that there exists a constant c > such that each n and each < j < tt,, 

f P l-j== > t^^"] dt< [" exp (-ct^/P ■ logn) dt + [ exp (-ct^/^) dt 

J a I V log J a -/in 

where t„ := n"'^'^^^{\ogn)~"'^'^. The second integral above goes to zero faster than any 
polynomial power of n whereas by choosing a sufficiently large we can make the first integral 
0{n~^). The claim of the lemma follows. □ 

3 Reduction to block diagonal form 
3.1 Some facts about P 

By the definition (jS]) P : C^" — t- C^" is a Hermitian projection matrix. The action of operator 
P can be described by the composition of the following three maps: For X G C^", we first 
take discrete Fourier transform of x, then project it to the ffist n Fourier frequencies and 
finally do the invert discrete Fourier transform. 
The entries of P are given by 



P{k,l) 



1 for = /, 

for k ^ l,\k — l\ is even, 

1 X. 1 



for \k — l\ is odd. 



l_exp ^-2M) 

Note that P{k, I) is a function of (k—l) only and that |P(fc, Z)| < Ci min {\k — l\,2n — \k — 

k 7^ / for some constant Ci. Hence, the maximum of norms of the rows or the columns of 

P has the following upper bound: 

2n-l 2n-l 

max V |P(fc,/)| < Calogn, max V |P(fc, /)| < logn, (12) 

k ' ^ I ' ^ 

1=0 k=0 

where C2 is some suitable constant. 
Limiting operator for P 

Let T be unit circle parametrized by T = {e^™ : x G (-1/2,1/2]} and ^^(T) := {/ : 
[-1/2,1/2] ^ C : /(-1/2) = /(1/2) and \f{x)\'^dx < 00}. We define an projection 
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operator 11 : £^(C) — > £^(C) as the composition of the following operators: 

n : f{C) A L\T) ^^^'^ L^T) ^ ^^^C), (13) 

where t/j : £'^{C) — >■ L^(T) is the Fourier transform which sends the coordinate vector e^, m e 
Z to the periodic function x i-)- e^'^*"*^ e -^^(T), ■?/^~^ is the inverse map of ip and X[o,i/2] '■ 
L^(T) — > L^(T) is the projection map given by / ^-)■ / ■ 1 [0,1/2]- The operator IT is Hermitian 
and is defined on the entire £^(C) and hence is self-adjoint. It is easy to check that for any 
k,leZ, 



{ek,nei) =: n(A;, /) = 




if A; = / 

if k ^ l,\k — l\ is even 
if I /c — / I is odd. 



Here (■, ■) denotes the usual inner product on £^(C). 
Define 2^4 operator norm of 11 as 

||n||2^4 := sup{||nv||4 : V G ^'(C), ||v||2 < 1}, 

where for any vector v e £^(C) and p> 1, ||v||p denotes the standard £p norm of v. 

Check that for any < k,l < 2n satisfying \k - l\ = 0((logn)^), |P2„(A;,0 - ^{k,l)\ 
converges to as n — > 00. Moreover, using the fact that the following hmit exists and is 
finite 

1 — sinx/x + i(l — cosx)/x 1 

lim -^^ — = -, 

x-i-o (1 — cos + isinx 2 

one can deduce that the rate of convergence 

|P2n(A;,0 -n(A;,OI = C>(n~^) as rn^ 00 and |A; - Z| = 0((logn)^). (14) 

3.2 Allowing e room 

As one might guess, the diagonal entries of D which have small absolute value should not 
have too much influence on determining the value of Ai(PDP). In this subsection, we will 
make the idea precise. For e > 0, consider the random set 

S = {0<j <2n-l : \dj\ > 6^2 log n}, 

let R :=diag(lje5), and let := DR. Then 

Ai(PDP) _ Ai(PD^P) 
V2 log n V2 log n 



lim — 

x^O X 



X 



1-e- 



+ i 



< 



P(D-D^)PLn P SD D-D"Ln P sn . ^ 

V21ogn ~ V21ogn ~ ^ ^ 
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3.3 Random partition of the interval 

For a set -B, we denote by jj^B the cardinality of B. 

Set r = [logn]'^, and let m = [n/r\ — 1. Divide the interval {0, 1,2, . . . ,2n — 1} into 
2m + 1 consecutive disjoint subintervals (called bricks) L_m, ■ ■ ■ , L_i, Lq, Li, ... , in a 
way that 

• the length of each Li is between r and 4r, and 

• the subdivision is symmetric: Li = 2n — 1 — L_i. 

Let C be the collection of nonempty sets consisting the intervals formed by taking union 
at most M = M(e) := 4 + 12/e^ consecutive intervals either from L_m+i, • • • , or 

from Li, L2, . . . , Lm-i- We will call an element of C as an admissible block. Note that if 
L & C, then size of L is bounded below and above by r and Mr respectively. 

We define a brick L^ to be visible if LCiS 7^ 0. Otherwise L^ is called invisible. Clearly, for 
1 < k < m, Lk is visible if and only if is visible. Partition the set {0, 1, . . . , 2?2 — 1} into 
disjoint random intervals J G A by subdividing it between each pair of consecutive invisible 
bricks. Note that in this random partition, each interval starts and ends with a "gap" of size 
at least (logn)^. More precisely, for any [a,b] G A, we have [a, a + (logn)'^] fl 5 = (unless 
a = 0) and [b - {\ognf, 6] n 5 = (unless b = 2n - 1). 

Proposition 8. With hight probability, the following holds: 

L If J e A, and J n S 7^ 0, then J E C and 

2. #(Jn5) < M for allJ e C. 
Proof. For any fixed s > 1 and < ji < < • • • < < 

s 

P{|4I > ev/21ogn, l<i<s]< Yl > se^2\ogn] 

fte{-i,+i} i=i 

e^s^ ■ \ogn 



- ^ ''''^ ' Var(^^^, + 0(snV7-i/2) . se^miE, 

< 0{n-'^'/^). (16) 

The second inequality in (|T6|) is a consequence of Bernstein's inequality and for third inequal- 
ity above we used the fact that Var(^*^^ Af^jJ = Yld=i Var((ij.) < (s + 2) (by Lemma [T8|) . 
Note that if i or ^ fails, then one of the following events holds. 
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(i) either of the bricks Ln and L_m is visible. 



(ii) there exists a stretch of M consecutive bricks from L^^+i, ■ ■ ■ ,L_i such that 
at least [M/2J — 1 of them are visible. 

(iii) there exists a stretch of M consecutive bricks (say, La, La+i, . . . , La+M-i) from L_m+i, 

such that YZo r\S)>M. 

By (fT6l) . we have Pjevent (i)} = 0{n~'^^^^). We observe that the events (ii) and (iii) are 
both contained in the following event. 

(iv) there exists a stretch of M consecutive bricks (say, La, La+i, ■ ■ ■ , La+M-i) from 
L_^+2, ...,L.i such that E£o ' n 5) > [M/2J - 1. 

Again by (fT6|) . if we fix a position of such M consecutive blocks La, La+i, ■ ■ ■ , La+M-i and 
then fix s = [M/2J — 1 positions ji,j2,... within the blocks La, La+i, ■ ■ . , La+M-i, the 
probability that ji,j2, ■ ■ ■ ,js ^ S is bounded above by 0{n~'^''^^^) = 0{n^^). Hence by union 
bound, Pjevent (iv)} = 0{n{\ogn)'^^''^n~'^). This implies that the probability that either of 
the events (i) or (iv) happens goes to as n — )■ 00. This completes the proof. □ 

3.4 Reduction to a block diagonal form 

Let B be the following block diagonal form of the matrix P, 



P{k, I) if k,l e J for some J G A, 
otherwise. 



Ai(PD^P)- max Xi(P[J]B'[J]P[J]) 



B(A;,/) = 

Lemma 9. The following holds with high probability, 

■ ' 0(1). 

Proof. The maximum above equals Ai(BD^B), so the left hand side is bounded above by 

IIPD^P - BD^B||,p < ||(P - B)D^P||3p + ||BD^(P - B)||sp 

< ||(P - B)Disp||P||sp + ||B||,p||D^(P - B)||sp. 

Note that since P is a projection matrix ||P||sp = 1 and by block-diagonality we have 
||B||sp = maxjgA IIPI-'^] lisp- The matrix P[J] is just P conjugated by a coordinate projection, 
so it has norm at most 1. 
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Since = RD we first bound the spectral norm of (P — B)R. Tlie maximal column 
sum of this matrix is bounded above by the maximal absolute row sum of P, which by (IT^ 
is at most C2logn. Note that ((P — B)R)(A;,/) = unless k,l are in different parts of A. 
This gives the upper bound for the maximal absolute row sum 

which holds with high probability. We used the fact that with high probability each part in 
A has at most M elements j where R(/,/) is nonzero (Proposition [8]), and different parts 
have gaps of size (logn)^ in between them. Since the spectral norm is bounded above by the 
geometric mean of the maximal row and column sums, we get || (P— B)R||sp = 0((logn)^^/^). 

By Bernstein's inequality, we can find a constant C3 such that with high probability 
llDlLn = max, \dj\ is at most C-i^/\ogn. Therefore, ||(P - 3)0^^^ = o(l) with high 
probability. Similarly, ||D^(P — B)||gp = 0(1) with high probability, which yields the 
desired result. □ 



4 Proof of the upper bound 

Assume that oq, ai, . . . , a„ are independent mean zero, variance one random variables which 
are uniformly bounded by n^/'^ . This section consists of the proof of the lower bound 

Proposition 10. With high probability 

Ai(P[J]DnJ]P[J]) ^ „^„2 ^ 

max , < 112^4 + ol). 

JeA-.jnSj^d) a/2 log n 

Lemma 11. Fix k > 1 and 61,62, ■ ■ ■ ,6k > 0. Then for sufficiently large n and for any 
< Ji < J2 < • • • < jfc < n, we have 



Proof. Let \\6\\2 := {6l + . . . + 61^^ . To avoid triviality, assume that ||5||2 > 0. Clearly, 



P{|c?jJ > 6i^2\ogn,...,\dj^\ > 6k^/2\ogn} 

k 

^ E p{(EwO/ii'^ii2>ii^ii2V2b^}. 



fte{±i} i=i 



By Lemma ITSl for any choice of (3i G {±1}, the sum [l3i6idj-^ + ... + (3k6kdji.) /\\6\\2 has 
variance one and can be expressed as a linear combination independent random variables as 
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Yl^=o^i^i suitable real coefficients Oi with \6i\ < 2kn and Y17=o^i ~ ^- R-^call that 
< n^/"^ so that we can apply Bernstein's inequality to obtain 



P{( J] AM,,)/P||2 > ||5||2v/2b^} < exp 
1=1 ^ 



log?! 



+ ||5||2v/2b^-2A;ni/7-i/2/3 



for sufficiently large n. This completes the proof of the lemma. □ 

Lemma 12. Fix rj > 0, k > 1 and ^1,^2, ■ ■ ■ ,Sk > 0. Then there exists a constant C4 = 
C^i^T], k) such that for sufficiently large n and for any Q < ji < 32 < ■ ■ ■ < ju < n, we have 

P{|(ijJ > 61 a/2 log n, Idj^l > 62^/2 logn, . . . , \dji, \ > 5^ a/2 log n for some 
5i, . . . , 4 > such that 6f + 6^ + . . . + 6l > 1 + t]} < C^n-^^^'^^^K 

Proof. Construct an ^^j^^-net Af for the interval [0, l+t]] by choosing [^^^^^J + 1 equally 
spaced points in [0, 1 + including both endpoints and I + rj. Therefore, given any 
Si, ... ,6k G [0, 1+77], we can find ai, a2, . . . , G A/" such that ^1 — 4(1^^ < «i < (^i for each i. 
This implies that al+a2+. ■ -+(^1 > 1+77/2. Clearly, the event > 61 \J2 logn, . . . , \dj^\ > 

Sk\/2 logra for some 61, ... ,6k > such that 6f + 62 + . . . + Si > I + rj} is contained in the 
finite union of events > aiy/2 logn, . . . , \dji,\ > aky/2 logn} where the union is taken 

over for all possible choices of G A/" for all i such that af + a"^ + ■ ■ ■ + al > 1 + 7^/2. Now 
we use the union bound and Lemma [11] to conclude the probability of the given event is 
bounded by 2''+\i^Af)''n-^^+''^/^\ We complete the proof by taking C4 = 2^+i(#A/')^ □ 

Corollary 13. For every rj > 0, with high probability, for all admissible blocks L ^ C and 
all distinct ji, . . . ,jM & L we have d'j_^ + . . . + d'j^^ < 1 + ?7. 

Proof. For a fixed admissible block and points ji the probability that the claim is violated 
is at most C4n~*^^~*'''/^^ by the Lemma. By union bound, the probability that the claim is 
violated is at most n(*'(^°f ■ Cn-^^+^/^^ = ©((logn)^*^^-^/^). □ 

By part 2 of Proposition |8l with high probability, D*^ contains at most M nonzero entries 
in every admissible block, it follows that for each 77 > with high probability for all L G £ 
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Therefore, we have 

Ai(P[J]D^[J]P[J] 



max 



JeA:Jn57^0 a/2 log n 

< sup |Ai (p[L]Diag(5i, ^2, • • • , 5#L)P[i^]) : L G £, 5| < 1 + r/}. (17) 



i=i 



Now by dH!), we have max^g^ l|P[-^] ~ n[L]||sp = O ^ '■^"^"^ j . Therefore, if we take g 
g(r;,) := M [logn]^ and H = [1, q] fl Z, then f|T7|) can be bounded by 



m < sup I Ai (n[L]Diag(5i, 52,..., 5#L)nW) : L e C,J2j] < 1 + r]^ + O 
< (1 + r/)i/2 sup I Ai (n[i7]Diag(5i, ^2, • • • , 5,)n[//]) ■ Y.^' < + O (^^ 



(logn 



n 



72 



By Rayleigh's characterization of the maximum eigenvalue of Hermitian matrices, the above 
supremum equals 

q 

sup { (v, n[i7]Diag(5i, 6^, . . . , 5,)n[/J]v) : 5| < 1, v G CM| v||2 < l}. 

i=i 

Denote by 6 the infinite dimensional vector {. . . ,6-i,6o,Si, . . .) in £^(]R). Now extending the 
range of optimization we get the upper bound 

sup|(^v,nDiag(5)nv^ :5g£2(R),P||2< 1,vg£2(C),||v||2< l}. (18) 

Now note that (with denoting coordinate-wise multiplication) 

'v,nDiag(5)nv\ = (nv, Diag((5)nv\ = (5,n^0nv" 



so if we fix v, this is maximized when 6 equals nv0nv divided by its length. The maximum, 
for v fixed is ||nv nv||2 = ||nv|||, so the expression f|T8|) equals ||n||2_j.4. This completes 
the proof of Proposition [TOl 



5 Proof of the lower bound 

Assume that oq, ai, . . . , a„ are independent mean zero, variance one random variables which 
are uniformly bounded by n^^"'. This section consists of the proof of the lower bound. 
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Proposition 14. For r > there exists e > so that with high probability 




Let Gq, Gi, . . . , Gn be i.i.d. standard Gaussians independent of (ai)o<i< 



Recall 



1 



n-l 



( 



271 jk 



) 



dj{a) 



V2n 



V2ao + {-iyV2an + 2 ^ cos 



n 



0<j< In. 



fc=i 



We will use d.j[G) to refer to the above sum with being replaced by Gi for each < i < n. 
Suppose we are given some nonzero real numbers Ui,U2, ■ ■ ■ ,Uk for some fixed k > 1 such 
that + M2 + • • • + tJ'k < 1- Fix any ri > small such that ri < \ui\ for each i and if we set 
u'g := |-Us| -r] > for 1 < s < A;, then {u[Y + . . . + {u'f.f < 1. Take p = p{n) := Tflogn]^, 6 = 
:= 3[logn]^ and N = N{n) := \n/2p\. For 1 < i < A^, define the intervals 



And let Ai to be the event that dip+j G \/2 logn/j for all j = — 6 + 1, . . . , 6 + fc. 

Proposition 15. Let Ai be as above. Then as n ^ 00, the probability that at least one of 
the events ^i, A2, . . . , Ajv happens converges to one. 

Proof. The proof is based on the second moment method. First of all, fix any smooth 
function : M — )■ [0, 1] such that il){x) = for x < and iIj{x) = 1 for x G [l,oo). For 
any reals a < b, the indicator function of the interval {a^/2logn, b\/2 logn) can be bounded 
below by the smooth function 




l(aV21ogn,6V21ogn)(^) ^ C(o,6)(a^) :— l\}{x — a 




)ip{b\/2logn — x). 



For a fixed a < b, the first, second and third derivatives of the function (^a,b) ^-re all bounded 



by some constant in the supremum norm. Define, for each i in 1 < i < N 



b+k 




j=-b+l 



Clearly, for each i, < 1^^. Thus by the Paley-Zygmund inequafity. 
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So, the proposition follows if we can show IE[(^^]^ Wi)"^] = (1 + o{l)){KJ2iLi ^iY- The rest 
of the proof is devoted to establishing this. 

We will write to denote the random variable corresponding to f fT9|) with dj{G) in 
place dj{a) for all j. Recall that ((ij(G))o<j<n is a sequence of i.i.d. standard Gaussian 
random variables. Using the following well known Gaussian tail estimates, 



< P{Go > x\< =, a; > 



x2 + 1 V27r X V27r 

we obtain, for all large n 

b+k 



j=-b+i 7=1 Vvlogri-n". 

= Vt f (logn)-'^/2 . ^-(K)2+...+(„;^)2)\ ^ ^20) 



uniformly over 1 < z < A^, which implies that Y.i=i^^F 

— 7- oo as n — 7- oo. Since the 

random variables are bounded by 1, we have 

N N N N 

J^VarW^^ < Y^mwh'] <Y.^W^ = o{{Y,^W^f). (21) 

1=1 1=1 1=1 1=1 

For any 1 < i ^ i' < N , the two subsets of indices {ip — h + 1, . . . ,ip + k + h} and 
{i'p — 6 + 1, . . . , i'p + /c + 6} are disjoint. Hence, and are independent of each other 
and therefore, 

W^W^W^] = E[W[^]E[W^9]. (22) 

Now dUD and ([22]) yield 

N N 

n(J2 ^'')'] = (1 + E ^'')'- (23) 

1=1 i=l 

Our next goal is to show that the differences EWi - EWf and E[l^jl^i/] - E[iyj^W^?] are 
of smaller order for each i ^ i' using the invariance principle given in Lemma [T71 

We apply Lemma [T71 with r = n + l,m = A; + 26, X = (oq, . . . , a„), Y = {Gq, . . . , G„), / = 
{dip-b+i, • . • , dip+k+b) and 

5(z)= n C/,(2;6+i) 
j=~b+i 

where z = (2:1, 2:2, ... , 2;2fe+fe) to obtain 

19 



where Rj and Tj are as defined in Lemma [T71 We will first find an upper bound for 1E[-Rj]. 
The bound for E[Tj] will be similar. Note that H^^/jHoo < 2n^^^'^ for each i and t. Since 
each function ft is linear, its higher derivaties all vanish. Also, \\d£d£>di'/g\\oo = 0(1) for each 
i, i' and i" and dgde'dgng^z) ^ only if z^+j G ^2 logra/j for j = 1 . . . fc. For a; G M, define a 
random vector Z(-'^(a;) := (oq, • • • , cij-2, a;, Gj, . . . , Therefore, we have 



CTtV 

Ei?, < — -E 



n 



3/2 



sup l{rfip+,(Z(-')(x)) G V21ogn/„ 1 < s < A;}] 



x:|x|<|aj_i| 



(25) 



Note that dj depends on aj_i linearly with absolute coefficient at most \j2/n, and that the 
random variable dj-i is bounded by n^/'^ . Thus the supremum above is at most 



1 |d,p+,(Z(^)(0))| > {\us\-v)^2\ogn-V2n 



1/7-1/2 



l<s<k\ 



which is independent of aj_i. Since Ea|_^ = 1, we have, with u'^ = \us\ — f] 



m < ^^Fl^^{\d^p+si'Z'^'\0))\ > <v/21^ - 1 < s < fc} (26) 

Note that if we truncate the random variables Gj, . . . Gn at level n^^''', then we can bound 
( 126|) using Bernstein's inequality exactly as we did in proving Lemma [TTl Towards this end, 
we define 

Z^-^^ = (ao, • ■ • , «i-2, 0, G'j,-l||Q^,|<„l/7}, . . . , Gnl{|G„|<nl/7})- 



Then 



21 < 



cm 



P {l d,p+,(Z(^))| > <V21ogn - V2n^'^~^l\ 1 < s < A;} 



^3/2-1/7 

+ ¥{\G(\> n^l^< for some £}] 
< 



cm 



0(n-(W)^+-+K)^)) + o (nexp (-n2/V2)) 



^1/2-1/7 

Hence, by combining (125]) . and fl27|l . we obtain 



(27) 



5^E[i?,] <0((log 



^n9^1/7-1/2 . ^-((«;)2+...+(n;)2) 



where the constant hidden inside the big-0 notation above does not depend on i. Similar 
computation yields the same asymptotic bound for X]^=i^K]- Therefore, by fl24|) and (120|) . 
we have 



rG^ , ^^n^^^^Vi/7-1/2 .^-(K)2+...+K)2) 



EflVi] = E[H/^^] + 0((logn)^ni 



) = (l + o(l))E[W^, 



■^1 



(28) 
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which hold uniformly in 1 < i < A^. By similar argument as above, we can also show that 
W.[W,W,.] =E[W^^W^?]+0((logn)9n^/^-i/2.^-2{K)2+...+K))2^ ^ (1 + o(l))E[W^^l^?], (29) 

uniformly 'ml <i ^ i' < N . From (1281) arguing similarly as we did in (l^T]) . we deduce 

N N 
i=l i=l 

Finally, combining fl22|) . f l29|) and f l30|) together, we have 



N f ^ \ 

Y,m^w,] = {l+o{l))\y^nm] 

i,i'=l \i=l J 



This implies that P| 'Ylii=i > l} = 1 — o(l) which completes the proof. □ 

For any finite k> 1, we write 11^ as a shorthand for k^k matrix n[{l, 2, . . . , A;}]. Arguing 
along the hue of the proof of the fact ||n||2_^4 = sup{(v, ndiag(5)nv) : 5 e f(R), \\6\\2 < 
1, V G ^^(C), ||v||2 < 1}, we can also show that 

l|nfc||L4 = sup{Ai(nfcdiag((5)nfc) -.SeR', WSh < i}. (3i) 

Next we prove 

Lemma 16. limfe^oo ||nA:||2^4 = ||n||2^4- 

Proof. Since the operators ip, ip"^, X[o,i/2] are all bounded and the inclusion map i : £^(C) — )■ 
£^(C) is also a bounded operator, we have ||n||2-s>4 < oo. 

It will be convenient to think of Il.2k+i as a linear operator acting on the space £^(C) with 
the representation Yi2k+i{h3) = n(i, j) for |j| < k and otherwise. Clearly, || 11^11 2^4 is 
increasing and bounded above by ||n||2_s.4. 

For the other direction. Consider a sequence of unit vectors G £^(C) supported on 
[—771,171] SO that ||nvm||4 — )■ ||n||2^4. Then for > m we have 

4 



|nv.m||4 - ||n2fc+lVm||4 = E |nv.m(i)|^= ^ 

i:\i\>k i:\i\>k 



J2 n(^,j)v^(j) 



Since |n(i, j)| < \i — j\ ^ and |vm(j)| < 1, the inside sum is bounded above by (2m + l)(i — 
m)~^. This gives the upper bound 

(2m + 1)^ j^^O, as fc->cx). 

e:\e\>k-m 

Letting k ^ 00 and then m — )■ 00 completes the proof. □ 
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Given r > 0, by ( 13T|) and Lemma [161 "we can find k > 1 sufficiently large and a vector 
u = M2, . . . , Mfc) G M'^ with ||u||2 < 1 such that Ai(nfcdiag(u)nfc) > ||n||2_>4 — t/2. By 
perturbing the coordinates a little, if necessary, we can also assume Us for each 1 < s < k. 
Now choose rj > sufficiently small such that 

1.0^ {us — rj,Us + rj) for each 1 < s < k. 

2. If V = {vi,V2, ...,Vk) G [ui-T],ui + r]]x . . . X [uk-T],Uk + T]], then Ai(nA;diag(v)nfc) > 

l|n||L4-r. 

3. sup{||v||2 : V e [ui - r],ui + 7]] X . . . X [uk - V^Uk + i]]} < 1. 

Finally choose e > small such that [— e, e] fl {ug — ri,Us + rj) = ^ for each 1 < s < k. 

By Proposition [T5] we know that one of the events Ai,l < i < N happens with high 
probability. If Ai occurs, then the points ip — [logra]^, . . . ,ip + k + [logra]^ is contained in 
a single partition block J G A. This is because when Ai occurs, dip+j G {us — ri,Us + t]) 
and hence |(ijp+j| > e for each j = 1, . . . , k and the property of our random partition which 
guarantees that each (random) partition block always has a padding of two invisible bricks 
of length [log n] ^ from each side. On the other hand, since two consecutive empty bricks can 
not belong to the same partitioning block, J has no other point from S except the k points 
ip + l,ip + 2, . . . ,ip + k. Write F := {ip + l,ip + 2, . . . ,ip + k}. Therefore, if A^ happens, 
then 

Ai(P[J]DnJ]P[J]) ^ Ai(P[F]D-[F]P[F]) 
a/2 logn ~ a/2 logn 

> inf |Ai(^P[F]diag(v)P[F]j : v e [ui - r],Ui + r]] x . . . x [uk - V,Uk + v]] 



By the convergence (IMl) of P to 11 this equals 

inf |Ai (^nfcdiag(v)nfc j : v G [ui - rj , Ui + f]] x ... x [uk - ■n,Uk + r]]^ - 0{n'^] 

>l|nfc||L4-^-o(n-i). 

So, by Lemma [16] we get with high probability, 
Ai(P[J]D^[J]P[J]) 



max 



JeAiJnS^O a/2 log n 
This yields the claim of Proposition [TH 



> l|nfc||L4 -r- 0{n-^) = ||n||2^4 - r - o(l). 
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6 Appendix 



6.1 Invariance Principle 

Let X = {Xi, . . . , Xr) and Y = {Yi, . . . , Y^) are two vectors of independent random variables 
with finite second moments, taking values in some open interval / and satisfying, for each 
i, E[Xi] = E[Yi\ and E[Xf] = E[Y^^]. We shall also assume that X and Y are defined on 
the same probability space and are independent. The following lemma is an immediate 



generalization of Theorem 1.1 of IChatterjed (120051 ) (based on Lindeberg's approach to the 



CLT). We need this more detailed version because we will use the invariance principle under 
the moderate deviation regime. 

Lemma 17. Let / = (/i, /2, . . . , fm) '■ IR™" be thrice continuously differentiable. If we 

set U = /(X) and V = /(Y), then for any thrice continuously differentiable g : M™" — )• M, 

r r 

|E[^?(U)] -E[(7(V)]| < Y,E[R,] + Y,E[T,], 

1=1 1=1 

where 

1 



Ri '■— -^l^il^ X sup |/ij(Xi, . . . , Xj_i, X, . . . , Fr, 

xe[mm{0,Xi),max{0,Xi)] 

Ti:=^|Fi|^x sup \hi{Xi,...,Xi_i,y,Yi+i,...,Yr)\, 

D ye[mm{0,Yi),tiia.x{0,Yi)] 



/i,(x) := J2 W.^7(/(x))5i/£(x)a,/,(x)a,/,(x) + 3 J] 9,a,^(/(x))af/,(x)a,/,(x) 

e,p,q=l i,p=l 
m 

+ ^a,^(/(x))af/,(x). 



Proof. The lemma can be proved by merely imitating the steps of lChatterjed (120051 Theorem 
1.1) but we include a proof here for sake of completeness. Let if : f — )■ M be the function 
H{x.) := (^(/(x)). It is a routine computation to verify that 9f/f(x) = hi{x) for all i. 
For < i < r, define the random vectors Zj := (Xi, X2, . . . , Xj, Fj+i, . . . , y^) and 
Wi := (Xi, X2, . . . , Xj_i, 0, Fj+i, . . . , Yn) with obvious meanings for z = and i = r. For 
1 < z < r, define 



and 



Errorf ) = H{Z,) - XAHiW^) - ^Xfm(W, 



Errorf = i/(Z,_i) - YAH{W,) - ^Y,^diH{W, 
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Hence, by Taylor's remainder theorem and the above observation about the third partial 
derivatives oi H, it follows that 

|Errorj-^''| < Ri and lErrorp-*! < Tj. 

For each i, Xi, Yi and Wj are independent. Hence, 

E[X,9,//(W,)] - E[y,9,i/(W,)] = E[X, - y,] • E[//(W,)] = 0. 

Similarly, ¥\XfdiH{Wi)] = W\Y^diH{Wi)]. Combining all these ingredients, we obtain 

iE[y(u)] - E[^(v)]i = I J] {mm - E[//(z,_o]) 

1=1 

< I ^ E (W,)] + W,) + Error«] 

i=l 

^ 1 

-Y,^[yAH{^i)] + -Fi'aii/(W,) + Errorf ] 
1=1 

r r 

5^E[i?,] + 5^E[T,], 

i=l i=l 

which completes the proof. □ 

6.2 Covariances between the eigenvalues of random circulant 

Lemma 18. Let ao,ai, . . . ,an be independent mean zero, variance one random variables. 
Define 




0<j< 2n. 



Then dj — d2n-j for < j < 2n. Moreover, the random variables dj,0 < j < n have mean 
and their covariances are given by 

f 2 zf j = ke{0,n}, 
E[djdk] = I 1 if < j = k < n, 

[ if 0<j^k<n. 
Proof. The fact that dj = d2n~j for < j < 2n and zero mean property is immediate from 
the definition of dj. Since be independent with variance one. 



E[d,dk] = 
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^ /27rm^\ l + (-l)' 



Plugging in x = m = 0, 1, 2, . . . , 2n in the well known Dirichlet kernel formula 

,n X sin((n — l/2)a;) 

we obtain 

Ecos I I 

1=1 ^ ^ 

unless m = or 2n when the sum equals to (n — 1). Using the above formula, the covariances 
E[(ij(ifc] can be easily computed. □ 

6.3 Optimisation problem and connection to the Sine kernel 

For any 5 > 0, let be the indicator function l[_5/2,(5/2]- For a complex valued function / 
defined on R, we define the involution /* by = f{—x). Given two functions / and g 

on R their convolution ficg is defined as {ficg){x) — f{x — y)g{y)dy provided the integral 

1/9 ~ 

makes sense. Let f~kjg(x) — J_[/2 fi^~y)9(y)dy denote the convolution of the two functions 
/ and g are in L^(T) where / and g are the periodic extension of / and g respectively on the 
whole real hne. Let f{t) — J^e~'^'^^^*f{x)dx be the usual Fourier transform of / e L^(R). 
Recall that is the discrete Fourier transform of from L^(T) to £^(C). Below we collect 
some basic facts of the usual and discrete Fourier transform which we will need later. 

1. ijj-^ : L2(T) ^ £2(C) and ^ L'^{R) L'^{R) are isometries. 



2. ^-i(/) = V'-Hr) and/ = /*. 

3. i^-^{f in g){k) = ilj-\f){k)7p-\g){k) for all A; e Z and for all f,g e L^{T). When 
f,geL\T),thenfirg^f-g. 

4. If / and g are supported on [0, 1/2] then f *t 9 — f 9- 

5. v^{t) = ^ 

Note that Sin(/)(x) = vi for / e L^(]R). The next lemma establishes the connection 

between the 2 — > 4 norm of the operator H and the 2 — > 4 norm of the integral operator Sin. 

Lemma 19. The following holds true. 

I|n||2^4 = ^^||Sin||2^4. 
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Let us define for 5 > 0, 

Ks:=snp[\\{fvsr*{fvs)\\2:feL' 
Lemma 20. Let Kg be as above. Then Kg = 
Proof of Lemma For any f e L"^ (M) , 



<1 . 



(32) 



Wifvsr^ifvs) 



fvsix + t)fvsit)dt 



dx 



After a change of variables s = t/6 and y = x/6 the above integral is same as 



fvs{6{s + y))fvs{6s)ds 



dy. 



(33) 



Keeping in mind that vs{Ss) = vi{s) and replacing / by g{x) := 5^^"^ f{5x) in ( l33|) . we obtain 



m = 5 



gvi{{s + y))fvi{s)ds 



dy. 



Since \\g\\2 = ll/lh, it follows that Kg = 6Kf which completes the proof of the lemma. □ 

Proof of Lemma [73 The proof consists of a series of elementary steps. Let k be the indicator 
function 1 [0.1/2]- Applying the Fourier transform from £^(Z) to L^(T), we get 

||n||^^4 = sup I ||n^ nv||2 : v g f (z), || v||2 < i 



snp<,\\^-\f-t,)Q^-\f-K)h:feL\T), 
The properties of the Fourier transform further imply 



< 1 



and since we convolve functions supported on [0, 1/2] we might as well do the entire opti- 
mization on the real line to get 



ini|2 



snp[\\{f-Kr^{f-K)h:feL' 



< 1 



By translating the function / G L^(]R) via the map / ^ /(■ + 1/4) in the above optimization 
problem, we see that ||n||2^4 = K1/2. This equals Ki/\/2 by Lemma [201 Now note that 



ll(M)*^(M)l|2 = ||M-MI|2 = IIMI 



|Sin(/)||^ 



and so 

K, = snp{\\S±n{f)\\l:feL' 
This completes the proof of the lemma. 



< 1 



ISinl 



2^.4- 



□ 
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As far as we know, the explicit value of constant Ki of (l32l) is not known in the literature- 



Howe ver, the maximization problem given in fl32l) has been studied in 
(jl968/69f ). We list below some of the interesting results from 

2 \ 1/2 



Garsia. Rodemich and Rumsey 



Garsia et al 



(11968 /69f ). 



Ki = sup I ^ ^ /m /(^ + t)f{t)dt^ dx^ : / G -T-"! where J-" is the class of all real- 
valued functions / satisfying /(x) > 0, /(x) = /(— x) for all x G M, f{x) > f{y) for 

1 /2 

< X < y and /(x) = for |x| > 1/2 and /_^/2 = 1- 



There exists a unique j ^ T such that ^ Jj^ ^ /m /(^ + 't)f{'t)dt^ dx 
Kf = 0.686981293033114600949413 . . . ! 
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